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(xi) SEQUENCE' DESCRIPTION: SEQ ID NO : 1 : 



GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 

Met 
1 



58 



CCG CGC GCT CCC CGCTGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 

5 10 15 

7 AC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG .GGG CCC 

Tvr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 

20 25 30 



106 



154 
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CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 
Gin Gly Trp Arg Leu Val Gin Arc Gly Asp Pro Ala Ala Phe Arg Ala 
35 40 ~ 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 
Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 
Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 
Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 
85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 
Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 HO 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 
Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro. Asn Thr Val Thr Asp 
115 120 " 125 

GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC 
t*u A-c GW Ser Glv Ala Tro Glv Leu Leu Leu Arg Arc Val Gly 
120 ' " 135 * 140 145 



202 



250 



298 



346 



394 



442 



490 



GAC GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 
Aso Asp Val Leu Val His Leu Leu Ala Arc Cys Ala Leu Phe Val Leu 
150 1^5 160 

GTG GCT CCC AGC TGC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 
Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 
165 ' ' 1^0 * 1^5 

CTC GGC GCT GCC ACT CAG GCC. CGG CCC CCG CCA CAC GCT AGT GGA CCC 
Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 
130. - 135 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 
Arc Arg Arg Leu Gly Cys Glu Arc Ala Trp Asn His Ser Val Arg Glu 



195 



200 



205 



GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 
Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210 215 220 



225 



GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arc Arg Gly 

230 235 240 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 

Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 
245 250 255 

CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT GTG GTG 

His Pro Glv Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 

260 265 270 



538 



586 



634 



632 



730 



778 



826 



874 




TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 92 2 

Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 280 285 

TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 97 0 

Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
290 295 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 1018 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAC TCC TCA GGC GAC 1066 
Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 



.-AG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 

Lvs Glu Gin Leu Arg Pro Ser Phe Leu Leu. Ser Ser Leu Arg Pro Ser 
340 345 350 

CTG ACT GGC GCT CGG AGG CTC GTG GAG ACC. ATC TTT CTG GGT TCC AGG 1162 

leu Thr Gly Ala Arg Arg Leu Val Giu Thr lie Phe Leu Gly Ser Arg 
355 360 * 365 

TTC TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 
Fro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 



375 



380 



385 



TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG 
Tvr Tro Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 



1258 



TAT TGC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 
Tin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 
405 4*10 415 



1306 



TTT GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC 1354- 
Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420. 425 430 

TTT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG 1402 
Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
4 35 4 40 "445 

CAG CTG CTC. CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 1450 
.Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
450 455 460 465 

CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 14 98 

Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 

•TAT AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC CTG . 154 6' 

His Asn Glu. Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser Leu 
485 490 495 

GTG AAG CAT GCC AAG CTC TCG CTG CAG GAG" CTG ACG TGG AAG ATG AGC 15 94 

Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
500 505 510 





5 



GIG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT 164 2 

Val Arg Asd Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
515 ^ 520 525 

u 

CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 1690 
Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe Leu 
530 535 540 545 



CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG CTC AGG TCT TTC TTT 
His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG CTC TTT TTC TAC CGG 
Tvr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
565 570 575 



1738 



1786 



AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AGA CAG CAC _TTG 
Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu 
580 ' 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 
Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 



595 



'600 



605 



CGG GAA GCC AGG CCC GCC CTG CTG ACG TCC AGA CTC CGC TTC ATC CCC 
~zz Glu Ala Arc Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro 
610 ' 615 620 625 

AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 
Lys Pro Aso Gly Leu Arg Pro lie Val Asn Met Asp Tyr Vai Val Gly 
630 635 . 640 

GCC AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG 
Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
645 650 " 655 

GTG AAG GCA CTG TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC 
Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 



660 



665 



670 



1834 



1882 



1930 



1978 



2026 



2074 



2122 



2170 



GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 

Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg Ala 
. " 675 680 685 

TGG CGC ACC .TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 

Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
69*0 695 700. 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 2218 

Leu Tyr Phe Val Lys Val Asp Vai Thr Gly Ala Tyr Asp Thr lie Pro 
710 715 720 

CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 22 6 6 

Gin Asp Arg Leu Thr Glu Val lie Ala Ser lie lie Lys Pro Gin Asn 

725 7.30 735 



ACC- TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 
Thr Tyr Cys Vai Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 



2314 
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CAC GTC CGC AAG GCC TTC 
His Val Arg Lys Ala Phe 
755 

CAG CCG TAC ATG CGA CAG 
Gin Pro Tyr Met Arg Gin 
770 775 

CTG AGG GAT GCC GTC GTC 
Leu Arg Asp Ala Val Val 
790 

AGC AGT GGC CTC TTC GAC 
Ser Ser Gly Leu Phe Asp 
805 

-GTG CGC ATC AGG GGC AAG 
Val Arg lie Arg Gly Lys 
820 

GGC TCC ATC CTC TCC ACG 
Gly Ser lie Leu Ser Thr 
S35 

GAG AAC AAG CTG. TTT GCG 
Glu Asn Lvs Leu Phe Ala 
350 * 855 

TTG GTG GA? GAT TTC TTG 
Leu Val Asp Asp Phe Leu 
870 

ACC TTC CTC AGG ACC CTG 
Thr Phe Leu Arg Thr Leu 
335 



AAG AGC CAC GTC 

Lys Ser His Vai 
7 60 

TTC GTG GCT CAC 

Phe Val Ala His 



ATC GAG 
He Glu 



GTC TTC 
Val Phe 



TCC TAC 
Ser Tyr 
825 

CTG CTC 
Leu Leu 
840 

GGG ATT 
Gly He 



CAG AGC 
Gin Ser 
795 

CTA CGC 
Leu Arg 
810 

GTC CAG 
Val Gin 



TGC AGC 
Cys Ser 

CGG CGG 
Arg Arg 



TCT ACC TTG ACA 
Ser Thr Leu Thr 
765 

CTG CAG GAG ACC 
Leu Gin Glu Thr 
780 

TCC. TCC CTG AAT 
Ser Ser Leu Asn 



GAC CTC 
Asp Leu 



GTG 



TGG 
Trp 
9 30 



AAC TTG CGG AAG ACA 
Asn Leu Arg Lys Thr 
900 



GGT 
Gly 
915 



GGC ACG GCT TTT 
Giv Thr Ala Phe 



TGC GGC CTG CTG CTG 
Cvs Giv Leu Leu Leu 
935 



TTG GTG ACA CCT 
Leu Val Thr Pro 
875 

GTC CGA GGT GTC 
Val Arg Gly Vai 
390 

GTG GTG AAC TTC 
Val Val Asn Phe 
905 

GTT CAG ATG CCG 
Val Gin Met Pro 
920 

GAT ACC CGG ACC 
Asp Thr Arg Thr 



TTC ATG TGC CAC 
Phe Met Cys His 
815 

TGC CAG GGG ATC 
Cys Gin Gly He 
830 

CTG TGC TAC GGC 
Leu Cys Tyr Gly 
845 

GAC GGG CTG CTC 
Asp Gly Leu Leu 
8 60 ' 

CAC CTC ACC CAC 
His Leu Thr His 



AGC 
Ser 



GAG 
Glu 
800 

CAC 
His 



CCG 
Pro 
785 

GCC 
Ala 



GCC 
Ala 



CCT GAG TAT GGC 
Pro Glu Tyr Gly 

.395 

CCT GTA GAA GAC 
Pro Val Giu As? 
910 

GCC CAC GGC CTA 
Ala His Gly Leu 
925 



CCG CAG 
Pro Gin 



GAC ATG 
Asp Met 



CTG CGT 
Leu Arg 
865 

GCG AA-A 

Ala Lys 
880 

TGC GTG 

Cys Val 



GAG GCC 
Glu Ala 



TTC CCC 
Phe Pro 



TAC TCC AGC TAT GCC CGG 
Tyr Ser Ser Tyr Ala Arg 
950 

CGC GGC TTC AAG GCT GGG 
Arg Gly Phe Lys Ala Gly 
965 

TTG CGG CTG AAG TGT CAC 
Leu Arg Leu Lys Cys His 
980 



ACC TCC ATG AGA 
Thr Ser He Arc 
955 

AGG AAC ATG CGT 
Arg Asn Met Arr 
970 

AGC CTG TTT CZZ 
Ser Leu Phe Leu 
985 



CTG 
Leu 
940 

GCC 
Ala 



GAG GTG CAG 
Giu Val Gin 



AGT CTC ACC 
Ser Leu Thr 



AGC GAC 
Ser Asp 
945 

TTC AAC 
Phe Asn 
960 



CGC A-AA CTC TTT 
Arg Lvs Leu Phe 
* 97 5 

GAT TTG CAG GTG 
Asp Leu Gin Val 
990 



GGG GTC 
Gly Val 



AAC AGC 
Asn Ser 



2362 



2410 



2458 



2506 



2554 



2602 



2650 



2698 



2746 



27-9' 



2842 



2890 



2938 



2986 



3034 
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CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG GCG 308 2 

Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin Ala 
995 1000 1005 

TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG CAA GTT 3130 
Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
1010 1015 1020 1025 

TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC TCT GAC ACG GCC TCC 317 8 

Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala Ser 
1030 1035 ' 1040 

CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC GCA GGG ATG TCG CTG GGG 322 6 

Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
1045 1050 1055 

GCC AAG GGC GCC GCC GGC CCT CTG CCC TCC GAG GCC GTG CAG- TGG CTG 3274 
Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
1060 1065 1070 

TGC CAC CAA 'GCA -TTC CTG CTC AAG CTG ACT CGA CAC CGT GTC -ACC TAC 3322 
Cvs His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 
1075 ~ 1080 1085 

GTG CCA CTC CTG GGG TCA CTC AGG ACA GCC CAG. ACG CAG CTG AGT CGG 3370 
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arc 
1090 1095 1100 1105 

AAG CTC CCG GGG ACG ACG CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG 3418 
Lys LjSU Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
1110 1115 1120 

GCA CTG CCC TCA GAC TTC AAG ACC ATC CTG GAC TGATGGCCAC CCGCCCACAG 34 71 

Ala Leu Pro Ser Asp Phe Lys Thr He Leu Asp 
■ 1125 / 1130 

CC AGG CCG AG AG CAG AC ACC AGCAGCCCTG TCACGCCGGG CTCTACGTCC* CAGGGAGGGA 3531 

GGGGCGGCCC ACACCCAGGC CCGCACCGCT GGGAGTCTGA GGCCTGAGTG AGTGTTTGGC 3591 

CGAGGCCTGC ATGTCCGGCT GAAGGCTGAG TGTCCGGCTG AGGCCTGAGC GAGTGTCCAG 3651 

CCAAGGGCTG AGTGTCCAGC ACACCTGCCG TCTTCACTTC CCCACAGGCT GGCGCTCGGC 3711 

TCCACCCCAG GGCCAGCTTT TCCTCACCAG GAGCCCGGCT TCCACTCCCC AC AT AG G AA T 3771 

AGTCCATCCC CAGATTCGCC ATTGTTCACC CCTCGCCCTG CCCTCCTTTG CCTTCCACCC 38 31 

CCACCATCCA GGTGGAGACC CTGAGAAGGA CCCTGGGAGC TCTGGGAATT TGGAGTGACC 38 91 

AAAGGTGTGC CCTGTACACA GGCGAGGACC CTGCACCTGG ATGGGGGTCC CTGTGGGTCA 3951 

AATTGGGGGG AGGTGCTGTG GGAGTAAAAT ACTGAATATA TGAGTTTTTC AGTTTTGAA-A 4 011 

— — 4015 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
1 5 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 

Pro Gin Glv Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
3 5 " 4 0 4 5 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

=>-o Pro &1- Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 - 70 75 80 

Val Ala Arc Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val. 

85 90 95 

Leu Ala Phe Glv Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

-Glu Ala Phe Thr Thr Ser Val Arc Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 " 125 

Aso Ala Leu Arc Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 ' 135 140 

Glv Aso Asc Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
145 * " 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 170 175 

Gin Leu Glv Ala Ala* Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
1.8 0 185 * 190 

Pro Arg Arc Arg Leu Gly Cys .Glu Arg Ala Trp Asn His Ser Val Arg 
195 200 205 

Glu Ala Glv Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 " 215 220 

Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro. Arg Arg 
225 230 235 240 

. Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 




Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280. 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr. Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser 
355 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 375 380 

Arg Tyr Trp Gin Met Arg -Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
335 - 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 4 10 4 15 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 '425 430 

Giy~Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
435 440 445 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
450 455 - 460 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
4 6 5 4 7 0 415 4 8 0 

Arc His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe He Ser 
435 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500. 505 510 . 

Ser Val Arg Aso Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 * '520 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe 
530 535 540 

Leu His Tro Leu Met Ser Val Tyr val Val Glu Leu Leu Arg Ser Phe 
545 * 550 555 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 
565 570 ' 575 

Arg Lys Ser Val" Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His 
580 * 535 590 
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Leu Lys Arg Val Gin Leu Arg Glu Leu Ser" Glu Ala Glu Val Arg Gin 
5S5 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg 
675 680 685 

Ala Tro Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

Giu Leu Tvr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 * 710 715 720 

Pro Gin Aso Arc Leu Thr Glu Val He Ala Ser He He Lys Pro Gin 
* ~ 725 730 735 

A<r. Thr Tyr Cys Val Arg Arg Tyr .Ala Val Val Gin Lys Ala Ala His 
740 ' 745. 750 

Giv His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 
755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 " " 775 780 

Pre Leu Arc Aso Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu 
7 = 5 ^ 790 795 800 



Ala 3er Ser Glv-Leu Phe Aso Val Phe Leu Arg Phe Met Cys His His 
" 305 " 810 815 

Ala Val Arc lie Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly lie Pro 
820 825 . 830 

Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 



835 



840 



845 



Met Giu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu 

850 855 860. 

Aro Leu Val Aso Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 

665 * 370 875 830 

Lvs Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys 

885 S90 895 

Val Val Asn Leu Arg Lvs Thr Val Val Asn Phe Pro Val Giu Asp Glu 

900 ' 905 910 




1 1 



GW Giv Thr Ala Phe Val Gin Wet Pro ? 
qn c " 920 



.ia His' Glv Leu ?ne 



Bro t rp cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 «5 940 

Aso Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe 
945 • 950 955 

Asn Arg Gly Phe Lys Ala Gly Arg Asn. Met Arg Arg Lys Leu Phe Gly 

965 970 

val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
980 985 99U 

S-r Leu Gin Thr Val Cys Thr Asn He Tyr Lys lie Leu Leu Leu Gin 
995- 100° 100 - 

... Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pre Phe His Gin Gin 
1010 1015 102° 

T ^o Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala^ 
1025 '. - 1030 1035 

Ser Leu Cys Tvr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Le 
1045 1° 50 105:3 



Pro Ser Glu Ala Val Gin Trp 

1060 



GW Ala Lvs Glv Ala Ala Gly Pro Leu 

1065 10/u 



, eu evs His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 



1075 



1080 1° 35 



--■ val p:o Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
io90 1095 ^ - 1100 

Leu Thr Ala Leu Glu Ala Ala Ala Asr. 



Ara Lvs Leu Pro Gly Thr Tnr , , , - ■ 

1105 ' mO 1115 

= -o Pi a L-u Pro Ser Asp Phe . Lys Thr He Leu Asp 
1125 - H30. 



{2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1..2176 

( D) OTHER INFORMATION : /noce= "clone ll^bbZ 



11 
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(i>:) FEATURE: 

(A) NAME /KEY: COS 

(3) LOCATION: 23. . 802 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GGCCAAGTTC CTGCACTGGC TGATGAGTGT GTACGTCGTC 
TTATGTCACG GAGACCACGT TTCAAAAGAA CAGGCTCTTT 
GAGCAAGTTG CAAAGCATTG GAATCAGACA GCACTTGAAG 
GTCGGAAGCA GAGGTCAGGC AGCATCGGGA AGCCAGGCCC 
CCGCTTCATC CCCAAGCCTG ACGGGCTGCG GCCGATTGTG 
AGCCAGAACG TTCCGCAGAG AAAAGAGGGC CGAGCGTCTC 
GTTCAGCGTG CTCAACTACG AGCGGGCGCG GCGCCCCGGC 
GGGCCTGGAC GATATCCACA GGGCCTGGCG CACCTTCGTG 
CCCGCCGCCT GAGCTGTACT TTGTCAAGGT GGATGTGACG 
CCAGGACAGG CTCACGGAGG TCATCGCCAG CAT CATC AAA 
GCGTCGGTAT C-CCGTGGTCC AGAAGGCCGC CCATGGGCAC 
CCACGTCCTA CGTCCAGTGC CAGGGGATCC CGCAGGGCTC 
GCAGCCTCTG CTACGGCGAC ATGGAGAACA AGCTGTTTGC 
7 3CTCCTGCG TTTGGTGGAT GATTTCTTGT TGGTGACACC 

c:ttcctcag gaccctggtc cgaggt'gtcc CTGAG~ATGG 

AGACAGTGGT GAACTTCCCT GTAGAAGACG AGGCCCTGGG 
7GCCGGCCCA CSGCCTATTC CCCTGGTGCG GCCTGCTGCT 
TGCAGAGCGA CTACTCCAGC TATGCCCGGA CCTCCATCAG 
GCGGC7TCAA GGCTGGGAGG AACATGCGTC GCAAACTCTT 
GTCACAGCCT GTTTCTGGAT TTGCAGGTGA ACAGCCTCCA. 
ACAAGA.TCCT CCTGCTGCAG GCGTACAGGT TTCACGCATG 
ATCAGCAAGT 77GGAAGAAC CCCACATTTT TCCTGCGCGT 
TCTGCTACTC CATCCTGAAA GCCAAGAACG CAGGGATGTC 
CCGGCCNTCT GCCCTCCGAG GCCGTGCAGT GGCTGTGCCA 
TGACTCGACA CCGTGTCACC TACGTGCCAC TCCTGGGGTC 
AGCTGAGTCG GAAGCTCCCG GGGACGACGC TGACTGCCCT 
CACTGCCCTC AGACTTCAAG ACCATCCTGG ACTGATGGCC 



GAGCTGCTCA 
TTCTACCGGA 
AGGGTGCAGC 
GCCCTGCTGA 
AACATGGACT 
ACCTCGAGGG 
CTCCTGGGCG 
CTGCGTGTGC 
GGCGCGTACG 
CCCCAGAACA 
GTCCGCAAGG 
CATCCTCTCC 
GGGGATTCGG 
TCACCTCACC 
CTGCGTGGTG 
TGGCACGGCT- 
GGATACCCGG 
AGCCAGTCTC 
TGGGGTCTTG 
GACGGTGTGC 
TGTGCTGCAG 
CATCTCTGAC 
GCTGGGGGCC 
CCAAGCATTC 
.ACTCAGGACA 
GGAGGCCGCA 
ACCCGCCCAC 



GGTCTTTCTT 60 

AGAGTGTCTG 120 
TGCGGGAGCT - 180 

CGTCCAGACT 24 0 

ACGTCGTGGG 300 

TGAAGGCACT 3 60 

CCTCTGTGCT 4 20 

GGGCCCAGGA 4 80 

ACACCATCCC .54 0 

CGTACTGCGT 600 

CCTTCAAGAG 660 

ACGCTGCTCT 7 20 

CGGGACGGGC "?8 0 

CACGCGAAAA 84 0 

AACTTGCGGA 900 

TTTGTTCAGA 960 

ACCCTGGAGG 10 20 

ACCTTCAACC 1080 

CGGCTGAAGT 114 0 

■ACCAACATCT 1200 

CTCCCATTTC 1260, 
ACGGCCTCCC ' 1320 

AAGGGCGCCG 138 0 

CTGCTCAAGC 14 4 0 

GCCCAGACGC 1500 

GCCAACCCGG 15 60 

AGCCAGGCCG 162 0 




13 



nor. or^n.£CA 


CCAGCAGCCC 


TGTCACGCCG 


GGCTCTACGT 


CCCAGGGAGG 


GAGGGGCGGC 


1660 


V, i-.U. H. \^ \^ rtva 


HCCTGCACCG 


CTGGGAGTCT 


GAGGCCTGAG 


TGAGTGTTTG 


GCCGAGGCCT 


1740 


oV^r*. i I *wV^uo 


CTG AAGGCTG 


AGTGTCCGGC 


TGAGGCCTGA 


GCGAGTGTCC 


AGCCAAGGGC 


1800 




CCAf ACCTGC 


CGTCTTCACT 


TCCCCACAGG 


CTGGCGCTCG 


GCTCCACCCC 


1860 




TTTCCTCACC 


AGGAGCCCGG 


CTTCCACTCC 


CCACATAGGA 


ATAGTCCATC 


1920 




CCATTGTTCA 


CCCCTCGCCC 


TGCCCTCCTT 


TGCCTTCCAC 


CCCCACCATC 


1980 


CAGGTGGAGA 


CCCTGAGAAG 


GACCCTGGGA 


GCTCTGGGAA 


TTTGGAGTGA 


CCAAAGGTGT 


2040 


GCCCTGTACA 


CAGGCGAGGA 


CCCTGCACCT 


GGATGGGGGT 


CCCTGTGGGT 


CAAATTGGGG 


2100 


GGAGGTGCTG 


TGGGAGTAAA 


ATACTGAATA 


TATGAGTTTT 


TCAGTTTTGN 


AAAAAAAAAA 


2160 
2176 



AAJ^JLAAAAAA AAAAAA 



(2) INFORMATION FOR SEQ ID NO : 4 : 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3855 base pairs 
{ 3 } TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(i:i) FEATURE: 

(A) NAME /KEY : - 

[B) LOCATION: 1..3355 - . 

/ D) OTHER INFORMATION: /note- "nucleic acid sequence with an 

open reading frame encoding a celta-182 
variant polypeptide" 

{!:•:) FEATURE: " 
(A) NAME/KEY: CDS 
(3) LOCATION: 56. - 2479 

( o ) OTHER INFORMATION: /product= "delta-132 variant 

polypeptide" 



(xi) SEQUENCE DESCRI PT ION : . SEQ ID NO : 4 : 
GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG AT< 



CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CA- 
Dr 0 ^ q Ala Pro Arc Cvs Arg Ala Vai Arg Ser Leu Leu Arc Ser Kis 
5 " - 10 15 

~*C CGr G*G GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 
T vr arg Glu'Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 
" " 20 25 30 



58 



106 
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C^G GGC TGG CGG CTG GTG CAG CGC GC-G GAC CCG GCG GCT TTC CGC GCG 
Gin Gly Trp Arc Leu Val Gin Arg Giy Asp Pro Ala Ala Pne *rg Ala 



35 



40 



45 



CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 
L lu vll S Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 



65 



CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 
Pro S Ala Pro- Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 
Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val. Leu 



85 



90 



GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 
■Ma Phe Gly Phe Ala Leu Leu Aso Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 HO 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 
Ala ?he Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Tnr Val ihr Asp 



115 



120 



125 



GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GiG GGC 
t ou ^ C civ Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 
V30 - * 135- 140 . i45 

G^ G^C GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 
to to "l Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 155 

rrr aGC TQC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 
J 2 oZ' 0 s* r cys Ala Tyr Gin Val Cys Gly Pro Pro Leo Tyr- Gin 
— •- . e5 170 _ 1 7 = 

r-T- G G<~ r.rr C r C ACT CAG GCC CGG CCC CCG CCA • CAC GCT AGT GGA CCC 
leu Giy Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 

130 185 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 
P-g L-u Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 
' 195 200 - 205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 
ai* Giy val ?ro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210' 215 220 225 

GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG. CGT GGC 
Glv Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg ^rg Gly 
230' 235 240 

GCT G rc CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 
^. S '*ia Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser 1 rp Ala 
245 250 2d5 

CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CCT GGT TTC TGT GTG GTG 
His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly ?ne Cys V«l Val 



202 



250 



298 



346 



394 



44 2 



490 



538 



586 



634 



682 



730 



778 



826 



874 



260 



265 
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TCA CCT 
Ser Pro 
275 

TCT GGC 
Ser .Gly 
290 

GGC CCC 
Gly Pro 



CCC CCG 
Pro Pro 



AAG GAG 
Lvs Glu 



GCC 



ACG 
Thr 



AG A 
Arq 

CGC 
Arg 



CCC 
Pro 



CAC 
His 



CCA 
Pro 



TCC 
Ser 



CTG ACT- 
Leu Thr 
355 

CCC TGG 

Pre Trp 
370 

TAG TGG 

7 v - 7 r o 



C AG 7GC 
Gin Cvs 



GCC- GTC 



GTG 
Val 



CAG 
Gin 
340 

GGC 
Gly 



TAC 
Tyr 
325 

CTG 
Leu 



GCT 

£ 1 a 



AC A 
Thr 
310 

GCC 
Ala 



GCC GAA 
Ala Glu 
280 

TCC CAC 
Ser His 
295 

TCG CGG 
Ser Arg 



CGG 
Arg 

CGG 
Arg 



ATG 



CCA 
Pro 



GGG 
Giv 



7C7 G7G 
Ser Val 
. 435 

CAG C7G 
Gin Leu 
4 50 

CGG GCC 
Arc Ala 



Gin 



CCC 
Pro 



ACC 
7hr 
420 

GCG 

r. I - 



A7G 



CGG 
Arg 
390 



GAG ACC 
Glu Thr 



CCC TCC 
Pro Ser 



AGG. CTC 
Arg Leu 
360 

ACT CCC 
Thr Pro 
375 

CCC C7G 
Pro Leu 



GAA GCC ACC 7CT 7TG GAG GGT GCG CTC 
Glu Ala Thr Ser Leu Glu Gly Ala Leu 
285 

CCA TCC GTG GGC CGC CAG CAC CAC GCG 
Pr-o Ser Val Gly Arg Gin His His Ala 
300 305 

CCA CCA CGT CCC TGG GAC ACG CCT TGT 
Pro Pro Arg Pro Trp Asp Thr Pro Cys 
315 • 320 

AAG CAC TTC CTC TAC TCC TCA GGC GAC 
Lys His Phe Leu Tyr Ser Ser Gly Asp 
330 335 

TTC CTA CTC AGC TCT CTG AGG CCC AGC 
Phe Leu Leu Ser Ser Leu Arg Pro Ser 
345 350 

GTG GAG ACC ATC TTT. CTG GGT TCC AGG 
Val Glu Thr lie Phe Leu Gly Ser Arg 
365 

CGC AGG TTG CCC CGC CTG CCC CAG CGC 
Arg Arg Leu Pro Arg Leu Pro Gin Arg 
380 > 385 

TTT CTG GAG CTG CTT GGG AAC CAC GCG 
one Leu Glu Leu Leu Gly Asn His Ala 
395 400 



GGG 
Glv 



CTC 
Leu 



TGC 
Cvs 



4 05 

CCA 
Pro 



GCC 



CGC 
Arg 



GCA 
Ala 



CCC 
Pro 



CAG 
Gin 



CTG 
Le u 



CAC AAC 
His Asn 

GGG AAG 
Gly Lys 



GAA 
Glu 



CAT 
His 
500 



CGC 
Arg 
485 



CGC 
Arg 
4 70 

CGC 
Arg 



AAG 
Lys 



•GTG CTC 
Val Leu 



GCC GGT 
Ala Gly 



GAG GAG 
Glu Glu 
440 

CAC AGC 
His Ser 
455 

CGG CTG 
Arg Leu 



TTC CTC 
Phe Leu 



CTC TCG 
Leu Ser 



CTC AA.G ACG CAC TGC CCG CTG CGA GC7 
Leu Lys Thr His Cys Pro Leu Arg Ala 
4 10- 4 15 

GTC TGT GCC CGG GAG AAG CCC CAG GGC 
Val Cys Ala Arg Glu Lys Pro Gin Gly 
425 • 430 

GAG GAC ACA GAC CCC CGT CGC CTG GTG 
Glu Asp Thr Asp Pro Arg Arg Leu Val 
445 

AGC CCC TGG CAG GTG TAC GGC TTC GTG 
Ser Pro Tro Gin' Val Tyr Gly Phe Val 
460 465 

GTG CCC CCA GGC CTC TGG GGC TCC AGG 
Val Pro Pro Gly Leu Trp Gly Ser Arg 
475 430 

AGG AAC ACC AAG AAG TTC ATC TCC CTG 
Arg Asn Thr Lys Lys Phe lie Ser Leu 
490 495 

C7G CAG GAG CTG ACG TGG " AAG ATG AGC 
Leu Gin Glu Leu Thr Trp Lys Met Ser 
505 510 



922 



970 



1018 



1066 



1114 



1162 



1210 



1258 



1306 



. 13 5 4 



1402 



1450 



1498 



1546 



1594 
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«T« COO «C .TCC OCT TGG CTG CGC AGG ACC CCA GCC fcTT GGC TGT CTT 

Val Arg £-sp Cys Ala Trp beu Arc .-.re Ser ..o c.y . . 



515 



520 



525 



- s s s; as s ss s ss s; £ s s s ss s 

530 535 540 

r; s 2S SS 55 !2 £.55 Eg S 2 E E £ K 

550 • 5bb 

m GTC ACG GAG ACC ACG TTT CAA AAG AA.C AGG CTC TTT TTC TAC CGG 
r7- Val Thr Glu Thr Thr Phe Gin Lys Asn Arc, Leu Phe Phe Tyr Arg 
' " 5"? 0 ^ 



565 



,, 0 AGT GTC TGG « « TTG CAA JSC ATT «£ ATC AGA CAG CAC TTG 

Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie arg 



580 585 



"I % « S5 S Arg Glu S S SS £ g " ^ K 

5 9 | • 600 605 



^ - n 615 



630 



_ . rrr eg » r.Bft AGG GCC GAG CGT CTC ACC TCG AGG 

? C I £ & Phi " 9 a", gTo Lys AT, M. G!U «, L., Tjr Mr «, 

645 o:)C 

^ r?r rrr :ar T-C GAG CGG GCG CGG CGC- CCC 

GTG AAG GCA CTG TTC aGC G G CiC . - ft prQ 

Lys Ala Leu ?ne Ser. V c l Leu .-.sn ^ ^ ^ 
660 665 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC.CAC AGG GCC 

r. W Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp h.s . 

675 680 685 . 

T« CGC ACC TTC GTG CTG CGT GTG CGG GCC GAG GAC CCG CCG OCT GAG 

T.ro Arg Thr Phe Val Leu Arg V„l r..g <-in ^ 

690 695 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 
t.-u Tyr Phe Val Lys Val Asp Val Tnr Gly Ala Tyr Asp inr ^ 
710 715 

. .~r- rrr *rr ETC A^C PV. CCC CAG AAC 

CAG GAC AGG CTC ACG GAG GTC ;; C C- C .G TC h.C ^ ^ ^ 

' " 725 . 730 

_„„ „, r _ CT . CGT CGG t.T GCC GTG GTC CAG AAG GCC GCC CAT GGG 

Thr Tyl £ Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala H.s Gly 

740 745 




1690 



1738 



1786 



1834 



1882 



1930 



1978 



2026 



2074 



2122 



2170 



2218 



2266 



2314 
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274 6 
2806 
2866 



C-C r.TC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA GTG CCA GGG 2 362 

hV s val' Arg Lys Ala Phe Lys Ser Kis Val Leu Arg Pro Val Pro Gly 
755 760 765 

G*T CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG CAG CCT GTG CTA 24 10 

As'd Pro Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu 
770 775 780 785 

CGG CGA CAT^GGA GAA CAA GCT GTT TGC GGG GAT TCGGCG GGA CGG GCT 24 58 

A-q Arg His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala 
790 795 800 

GCT CCT GCG TTT GGT GGA TGATTTCTTG TTGGTGACAC CTCACCTCAC 250 6 
Ala Pro Ala Phe Gly Gly 
805 

CCACGCGAAA ACCTTCCTCA GGACCCTGGT CCGAGGTGTC CCTGAGTATG GCTGCGTGGT 256 6 

GAACTTGCGG AAGACAGTGG TGAACTTCCC TGTAGAAGAC GAGGCCCTGG GTGGCACGGC 2 62 6 

TTTTGTTCAG ATGCCGGCCC ACGGCCTATT CCCCTGGTGC GGCCTGCTGC TGGATACCCG 2 68 6 
GACCCTGGAG GTGCAGAGCG ACTACTCCAG CTATGCCCGG ACCTCCATCA GAGCCAGTCT 
CACCTTCAAC CGCGGCTTCA AGGCTGGGAG GAACATGCGT CGCAAACTCT TTGGGGTCTT 
GCGGCTGAA.G TGTCACAGCC TGTTTCTGGA TTTGCAGGTG AACAGCCTCC AGACGGTGTG 
CACCAACATC 7ACAAGATCC TCCTGCTGCA GGCGTACAGG TTTCACGCAT GTGTGCTGCA • 2 92 6 

GCTCCCATTT CATCAGCAAG TTTGGAAGAA CCCCACATTT TTCCTGCGCG TCATCTCTC-A 2 98 6 

.C.-.CGGCCTCC CTCTGCTACT CCATCCTGAA AG C C AAG AAC GCAGGGATGT CGCTGGGGGC 304 6 
CAAC-GGCGCC GCCGGCCCTC TGCCCTCCGA GGCCGTGCAG TGGCTGTGCC ACCAAGCA77 
CC7GC7CAAG CTC-ACTCGAC ACCGTGTCAC C7ACGTGCCA CTCCTGGGGT CACTCAGC-AC 
AC- CC C AG AC G CAGCTGAGTC GGAAGCTCCC GGGGACGACG CTGACTGCCC TGGAGGCCGC _ 3226 

ACCCAACCCG GCACTGCCCT CAGACTTCAA GACCATCCTG GACTGATGG.C CACCCGCCCA 32 8 6 

CAC-CCAGGCC GAGAGCAGAC ACCAGCAGCC CTGTCACGCC GGGCTCTACG TCCCAGGGAG 334 6 
GGAGGGGCGG CCCACACCCA GGCCCGCACC GCTGGGAGTC TGAGGCCTGA GTGAGTGTTT " 34 06 

GGCCGAGGCC TGCATGTCCG GCTGAAGGGT GAGTGTCCGG CTGAGGCCTG AGCGAGTGTC 34 66 
CAGCCAAGGG CTGAGTGTCC AGCACACCTG CCGTCTTCAC TTCCCCACAG GCTGGCGCTC .. 3526 

GGCTCCACCC CAGGGCCAGC TTTTCCTCAC CAGGAGCCCG GCTTCCACTC CCCACATAGG 358 6 

AATAGTCCAT CCCCAGATTC GCCATTGTTC ACCCCTCGCC CTGCCCTCCT TTGCCTTCCA 364 6 

CCCCCACCAT CCAGGTGGAG ACCCTGAGAA GGACCCTGGG AGCTCTGGGA ATTTGGAG7G 3706 

ACCAAAGGTG 7C-CCCTGTAC ACAGGCGAGG ACCCTGCACC TGGATGGGGG 7CCCTGTC-C-G 37 66 
TCAAATTGGG GGGAGGTGCT GTGGGAGTAA. AATACTGAAT A7A7GAGTTT 7TCAGTTT7G 
L^iiu _ ^ L_t_i_E_&_2-_A.i. AA-AAA-AA-AA 



3106 
3166 



3326 



id D d 
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(2) INFORMATION FOR SEQ iD NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 807 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

M*t o ro Arq Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
~1 5 10 15 

H^s Tyr A-a Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
' 20 . 25 - 30 

,r 0 Gin Giv Tro Ara Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
3 5 4 0 4S . 

pi a L*u Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

d--. Pro A-U =ro Ser Phe Arg Gin "Val Ser Cys Leu Lys Glu Leu 

^65 " 70 7 5 80 

v a ; *ia A-c Val Leu Gin- Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
35 50 55 

t»u Ala oh» c-iv Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 . 105 110 

q : : ■ :i 2 ph, -hr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 12_5 

-so ■ a Leu Arc Glv Ser Gly .Ala Trp Gly Leu Leu Leu Arc Arg Val 
130 135 .140 

Giv Aso Aso Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
l.-:*5 ' - " 150 155 160 

V pi A - ->^ 0 ser Cys Ala Tvr Gin Val Cys Gly Pro Pro Leu Tyr 
165 1 7 0 . • 1 7 5 

r.H l-u Giw Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 

~ J VsO " 135 .150 

Fro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
. 195 ~ 200 205 . 

Glu Ala Glv Val- Pro Leu Gly Leu Pro Ala Pro' Gly Ala Arg Arg Arg 
210 " 215 , 220 . 

Glv GW S^r Ser Ara Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg \ 

_ : ?3S 2 4 0 

^3 230 

C-v Ala *1* o-o Glu Pro Glu Arc Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

- - z r-i-./ sir-c ^hr Arc Gly Pro Ser Asp Arg Gly Phe Cys Val 




val Ser Pro Ala Arg Pro Ala GIu GIu Ala Thr Ser Leu Glu Giy Ala 

~ ^ ICC 



Id 



2S0 



Leu Ser Giy Thr Arg His Ser His Pro Ser Vai Giy Arg Gin His His 

290 295 300 

Ala Giy Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 

305 310 315 320 

Cys Pro- Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Giy 

325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 

340 345 350 

Ser Leu Thr Giy Ala Arg Arg Leu Val Giu Thr He Phe Leu Giy Ser 

355 360 _ 365 

Arg Pro Tro Met Pro Giy Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 



370 



375 



380 



--a Tv r Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Giy Asn His 

335 " 390 3.95 . 400 

Ala Gin Cvs Pro Tyr Giy Val Leu Leu Lys Thr His Cys Pro Leu Arg 
4 05 4 10 4 15 

Ala Ala Vai Thr Pro Ala Ala Giy Val Cys Ala Arg Giu Lys Pro Gin 
420 ^25 430 

Giy Ser Val Ala. Ala Pro Glu Giu Glu Asp Thr Asp Pro Arc Arg Leu 
4 25 440 445 

Vai Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr- Giy Phe 
450 455 - 460 

y*i p.-g £ i j Cvs Leu Arc Arc Leu Val Pro Pro Giy Leu Trp Giy Ser 
465 470 475 430 

Ar-j His Asn Glu Arc Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
485 450 495 

Leu Giy Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 - 505 510 

Ser Val Arc Asp Cys Ala Trp.Leu Arg Arg Ser Pro Giy Val Giy Cys 
515 520 525 

Val Pro Ala Ala Giu His Arg Leu Arg Giu Glu lie Leu Ala Lys Phe 
530 535 540 

Leu His T-o Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 " 550 555 560 

Phe Ty r V pi Thr Glu Thr Thr. Phe Gin Lvs Asn Arc Leu Phe Phe Tyr 
565 ' 570 575 

Arc *Lvs Se- Val Tro Ser Lvs Leu Gin Ser -He Giy lie Arc Gin His 
580 585 590 
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Le»- Lvs Arc Vai Gin Leu Arg 

J ' =.oS 600 



Glu Leu Ser GIu Ala GIu vai Arc Gin 

605 



His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg ?he lie 



610 



615 



P-o Lys Pro Asp Gly Leu Arg Pro 
625 «0 



620 

He Vai Asn Met Asp Tyr Vai Vai 
635 640 



Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 



645 



650 



Arg Vai Lys Ala Leu Phe Ser Vai Leu Asn Tyr Glu Arg Ala Arg Arg 



660 



665 



Pro Gly Leu Leu Gly Ala Ser 
675 



Vai Leu Gly Leu Asp Asp He His Arg 



680 



685 



Ala Trp Arg Thr Phe Vai Leu Arg 



Vai Arg Ala Gin Asp Pro Pro Pro 



690 



695 



700 



Glu L*=>u Tyr Phe Vai Lys Vai Asp 

710 



Vai Thr Gly Ala Tyr Asp Thr He 



715 



720 



Gin ^so Arg Leu Thr Glu Vai 

725 730 



lie Ala Ser He He Lys Pro-Glh 

735 



Thr Tyr Cys Vai Arg Arg Ty 



yr Ala Vai Vai Gin Lys Ala Ala His 



740 



745 



750 



Glv His Vai Arc Lys Ala Phe Lys 



Ser His Vai Leu Arg Pro Vai Pro 



1 3 D 



760 



765 



r.-„ *so P-o Ala Glv Leu His Pro Leu . His Ala Ala Leu Gin. Pro Vai 

775 730 



! 70 



Arc Arc His Gly Glu Gin Ala 
-7*= " " 7 90' 



v-1 Cys Giy Asp Ser Ala Gly Arg 
795 . 800 



da Pro Ala Phe Giy Giy 
805 



[2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : . 4335 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) KOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) na;-:e/key: - 

(3) LOCATION:" 1. .4335 - 



(D) OTHER INFORMATION: /note= "genomic DNA insert- of pGRN14 4» 
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(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 2715.. 2818 

(D) OTHER INFORMATION: /note= "intron 1" 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4173.. 4326 

" (D) OTHER INFORMATION: /note* "intron 2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCATGGGACC CACTGCAGGG GCAGCTGGGA GGCTGCAGGC TTCAGGTCCC AGTGGGGTTG 
CCATCTGCCA GTAGAAACCT GATGTAGAAT CAGGGCGCGA GTGTGGACAC TGTCCTGAAT 
CTCAATGTCT CAGTGTGTGC TGAAACATGT AG AAAT T AAA GTCCATCCCT CCTACTCTAC 
TGGGATTGAG CCCCTTCCCT ATCCCCCCCC AGGGGCAGAG GAGTTCCTCT CACTCCTGTG 
GAGGAAGGAA TGATACTTTG TTATTTTTCA "CTGCTGGTAC TGAATCCACT GTTTCATTTG 
TTGGTTTGTT TGTTTTGTTT TGAGAGGCGG TTTCACTCTT GTTGCTCAGG CTGGAGGGAG 
7GCAATGGCG CGATCTTGGC TTACTGCAGC CTCTGCCTCC CAGGTTCAAG TGATTCTCCT 
GCTTCCGCCT CCCATTTGGC TGGGATTACA GGCACCCGCC ACCATGCCCA GCTAATTTT? 
7G7ATTTTTA GTAGAGACGG GGGTGGGGGT GGGGTTCACC ATGTTGGCCA GGCTGGTCTC 
GAACTTCTGA CCTCAGATGA TCCACCTGCC TCTGCCTCCT AAAGTGCTGG GATTACAGG7 
G7GAGCCACC ATGCCCAGCT CAGAATTTAC TCTGTTTAGA AACATCTGGG TCTGAGGTAG 
GAAGCTCACC CCACTCAAGT GTTGTGGTGT TTTAAGCCAA TGATAGAAT7 TTTTTATTGT 
T G T T AG AAC A CTCTTGATGT TTTACACTGT GATGACTAAG ACATCATCAG CTTTJCAAAG 
A C AC A C T AAC TGCACCCATA ATACTGGGGT GTCTTCTGGG TATCAGCGAT CTTCA.TTG.-_-. 
TGCCGGGAGG CGTTTCCTCG CCATGCACAT GGTGTTAATT ACTCCAGCAT AATCTTC7C-C 
TTCGATTTCT TCTCTTCCCT CTTTTAAAAT TGTGTTTTCT ATGTTGGCTT CTCTGCAGAG 
.AA.CCAGTGTA AGCTACAACT TAACTTTTGT TGGAACAAAT TTTCCAAACC GCCCCTTTGC 
CCTAGTGGCA GAGACAATTC ACAAACACAG CCCTTTAAAA AGGCTTAGGG ATCACTAAGG 
GGATTTCTAG AAGAGCGACC CGTAATCCTT AAGTATTTAC AAGACGAGGC TAACCTCCAG 
CGAGCGTGAC AGCCCAGGGA GGGTGCGAGG CCTG7TCAAA TGCTAAGCTT CCATAAATAA 
AGCAAATTTC CTCCGGCAGT TTCTGGAAAG TA.GGAAAGGT TAACATTTAA GGTTGCGTTT 
GTTAGCATTT CAGTGTTTGC CGACCTCAGC TP-AC AG CATC CCTGCAAGGC CTCGGGAGAC 
CCAGAAGTTT CTCGCCCCTT AGATCCAAAC TTGAGCAACC CGGAGTCTGG ATTCCTGGGA 
AGTCCTCAGC TGTCCTGCGG TTGTGCCGGG GCCCCAGGTC TGGAGGGGAC CAGTGGCCG7 



60 
120 
.180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1320 
1330 
14 4 0 
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GTGGCTTCTA CTGCTGGGCT GGAAGTCGGG CCTCCTAGCT CTGCAGTCCG AGGCTTGGAG 1500 

CCAGGTGCCT GGACCCCGAG GCTGCCCTCC ACCCTGTGCG GGCGGGATGT GACCAGATGT 15 60 

TGGCCTCATC TGCCAC-ACAG AGTGCCGGGG CCCAGGGTCA AGGCCGTTGT GGCTGGTGTG 1620 

AGGCGCCCGG TGCGCGGCCA GCAGGAGCGC CTGGCTCCAT TTCCCACCCT TTCTCGACGG 1680 

GACCGCCCCG GTGGGTGATT AACAGATATT GGGGTGGTTT GCTCATGGTG GGGACCCCTT 17 4 0 

CGCCGCCTGA GAACCTGCAA AG AG AAAT G A CGGGCCTGTG TCAAGGAGCC CAAGTCGCGG 1800 

GGAAGTGTTG CAGGGAGGCA CTCCGGGAGG TCCCGCGTGC CCGTCCAGGG AGCAATGCGT 18 60 

CCTCGGGTTC GTCCCCAGCC GCGTCTACGC GCCTCCGTCC TCCCCTTCAC GTCCGGCATT 1920 

CGTC-GTGCCC GGAGCCCGAC GCCCCGCGTC CGGACCTGGA GGCAGCCCTG GGTCTCCGGA 1980 

TCAGGCCAGC GGCCAAAGGG TCGCCGCACG CACCTGTTCC CAGGGCCTCC ACATCATGGC 2040 

CCCTCCCTCG GGTTACCCCA CAGCCTAGGC CGATTCGACC TCTCTCCGCT GGGGCCCTCG 2100 

CTGGCGTCCC TGCACCCTGG GAGCGCGAGC GGCGCGCGGG CGGGGAAGCG CGGCCCAGAC . 2160 

CCCCGGGTCC GCCCGGAGCA GCTGCGCTGT CGGGGCCAGG CCGGGCTCCC AGTGGATTCG 22 20 

CGGGCAACAG ACGCCCAGGA CCGCGCTTCC CACGTGGCGG AGGGACTGGG GACCCGGGCA 2 280 

CCGGTCCTGC CCCTTCACCT TCCAGCTCCG CCTCGTCCGC GCGGAACCCC GCCCCGTCCC 2 34 0 

GAACCCTTCC CGGGTCCCCG GCC.CAGCCCC TTCCGGGCCA TCCCAGCCCG TCCCGTTCCT 2 4 00 

777CCGCGGC CCCGCCCTCT CCTCGCGGCG CGAGTTTCAG CCAC-CGCTGC GTCCTGCTGC 2 4 60 

G C AC G T G G G A AGCCCTGGCC CCGGCCACCC CCGCGATGCC GCGCGCTCCC CGCTGCCGAG 

CCGTGCGCTC CCTGCTGCGC AGCCACTACC GCGAGGTGCT GCCGCTGGCC- ACGTTCGTGC 

CGCGCCTGGG GCCCCAGGGC TC-GCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

CC-CTGGTGGC CCAGTGCCTG GTGTGCGTGC CCTGGGACGC ACGGCCGCCC CCCGCCGCCC 

CCTCCTTCCG CCAGGTGGGC CTCCCCGGGG TCGGCGTCCG GCTGGGGTTG AGGGCGGCCG 27 60 

GGGGGAACCA GCGACATGCG GAGAGCAGCG CAGGCGACTC AGGGCGCTTC CCCCGCAGGT '28 20 

GTCCTGCCTG AAGGAGCTGG TGGCCCGAGT GCTGCAGAGG CTGTGCGAC-C GCGGCGCGA.-. 2 880 

G.-.-.CGTGCTG GCCTTCGGCT TCGCGCTGCT GGACGGGGCC CGCGGGGGCC CCCCCGAGGC 2940 

CTTCACCACC AGCGTGCGCA GCTACCTGCC CAACACGGTG ACCGACGCAC TGCGGGGGAG 

i 

CGC-GGCGTGG GGGCTGCTGC TGCGCCGCGT G G G C G AC G AC GTGCTGGTTC ACCTGCTGGC 

\ 

ACGCTGCGCG CTCTTTGTGC TGGTGGCTCC CAGCTGCGCC TACCAGGTGT GCGGGCCGCC 3120 

GCT3TACCAG CTCGGCGCTG CCACTCAGGC CCGGCCCCCG CCACACGCTA GTGGACCCCC- 3180 

.-.-.GGCGTCTG GGATGCGAAC GGGCCTGGAA CCATAGCGTC AGGGAGGCCG GGGTCCCCCT 32 4 0 

GGGCCTGCCA GCCCCGGGTG CC-AGGAGGCG CGGGC-GCAGT GCCAGCCG.-A. GTCTGCCGTT 3300 




2 520 
2580 
2640 • 
2700 



3000 
3060 
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GCCCAAGAGG 
GTCCTGGGCC 
ACCTGCCAGA 
CTCCCACCCA 
ACGTCCCTGG 
CTCAGGCGAC 
GACTGGCGCT 
GACTCCCCGC 
GGAGCTGCTT 
GCTGCGAGCT 
T-GTGGCGGCC 
GCACAGCAGC 
CCCAGGCCTC 
CATCTCCCTG 
GCGGGACTGC 
AGGCCCCAGA 
C'-TGTCTCCA 
r-'-.-Trr.r.r': 



CCCAGGCGTG 
CACCCGGGCA 
CCCGCCGAAG 
TCCGTGGGCC 
GACACGCCTT 
AAGGAGCAGC 
CGGAGGCTCG 
AGGTTGCCCC 
GGGAACCACG 
GCGGTCACCC 
CCCGAGGAGG 
CCCTGGCAGG 
TGGGGCTCCA 
GGGAAGCATG 
GCTTGGCTGC 
GCTGAATGCA 
TCGTCACGTG 
TCGAC 



GCGCTGCCCC 
GGACGCGTGG 
AAGCCACCTC 
GCCAGCACCA 
GTCCCCCGGT 
TGCGGCCCTC 
TGGAGACCAT 
GCCTGCCCCA 
CGCAGTGCCC 
CAGCAGCCGG 
AGGACACAGA 
TGTACGGCTT 
GGCACAACGA 
CCAAGCTCTC 
GGAGGAGCCC 
GTAGGGGCTG 
GG G AC ACGTG 



TGAGCCGGAG 
ACCGAGTGAC 
TTTGGAGGGT 
CGCGGGCCCC 
GTACGCCGAG 
CTTCCTACTC 
CTTTCTGGGT 
GCGCTACTGG 
CTACGGGGTG 
TGTCTGTGCC 
CCCCCGTCGC 
CGTGCGGGCC 
ACGCCGCTTC 
GCTGCAGGAG 
AGGTGAGGAG 
AGAAAAGGGG 
GCTTTTCGCT 



CGGACGCCCG 
CGTGGTTTCT 
GCGCTCTCTG 
CCATCCACAT 
ACCAAGCACT 
AGCTCTCTGA 
TCCAGGCCCT 
CAAATGCGGC 
CTCCTCAAGA 
CGGGAGAAGC 
CTGGTGCAGC 
TGCCTGCGCC 
CTCAGGAACA 
CTGACGTGGA 
GTGGTGGCCG 
GCAGGCAGAG 
CAGGACGTCG 



T i'GGGC AGGG 
GTGTGGTGTC 
GCACGCGCCA 
CGCGGCCACC 
TCCTCTACTC 
GGCCCAGCCT 
GGATGCCAGG 
CCCTGTTTCT 
CGCACTGCCC 
CCCAGGGCTC 
TGCTCCGCCA 
GGCTGGTGCC" 
CCAAGAAGTT 
AGATGAGCGT 
TCGAGGGCCC 
CCCTGGTCCT 
AGTGGACACG 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
. 4 260 
4 320 
4 335 



\2) INTONATION FOR SEQ ID MO : 7 : 

(i) sequence: CHARACTERISTICS : 

(A) LENGTH: 240 base pairs 
(3) TYPE: nucleic acid 
. (C) STRA.NDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(i>:) FEATURE: 

(A) NAME/ KEY : intron 
(3) LOCATION : 95 . . 1SS 

(D) OTHER INFORMATION: /note- "intron. 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
;gacccggcg GCTTTCCGCG CGCTGGTGGC CCAGTGCCTG GTGTGCGTGC CCTGGGACGC 
^.CGGCCGCCC CCCGCCGCCC CCTCCTTCCG CCAGGTGGGC CTCCCCGGGG TCGGCGTCCG 
:C?GGGGTTG AGGGCGGCCG GGGGGAACCA GCGAGATGCG GAGAGCAGCG CAGGCGACTC 




60 
120 

180 
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AGGGCGCTTC CCCCGCAGGT GTCCTGCCTG AAGGAGCTGG TGGCCCGAG7 GCTGCAGAGG 24 0 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix.) FEATURE: 

(A) NAME/KEY; - 

(B) LOCATION: 1 . . 389 

(D) OTHER INFORMATION: /note- "expressed sequence tag (EST) 

AA281296" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 

7A7GTCACGG AGACCACGTT T C AAAAG AAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 

AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGACGTG 180 

7CGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 240 

CG7T7CATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA AC AT GG ACT A CGTCGTGGGA 300 

G7CAGAACG7 TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 

Z~.Z CG7GC 7 C AAC 7 AC G A GCGGGCGCG . . 33 9 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 182 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . . 132 

(D) OTHER INFORMATION: /note= "182 basepair sequence deleted in 

clone 712562" 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO : 9 : 
fCTACCTTGA CAGACCTCCA GCCGTACATG CGACAGTTCG TGGCTCACCT GCAGGAGACC 60 

-.GCCCGCTGA GGGA7G CCGT CGTCA7CGA.G CA.GAGCTCCT CCCTGAA7GA GGCCA.GCAG7 120 
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3GCCTCTTCG ACGTCTTCCT ACGCTTCATG TGCCACCACG CCGTGCGCA7 CAGGGGC- 



180 
18 2 



TC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 



IS! SS"S«lii"»= /«.«- "protein encoded by clone ,12«2' 



\y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 

C 10 



1 b 
Glu Thr Thr Phe Gin Lys Asn Arc Leu Phe Phe Tyr Arg Lys Ser Val 

20 25 
T ro Ser Lys Leu Gin Ser lie Gly He Arg Gin His Leu Lys Arg Val 

35 <° <J . 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 

50 bb _ 

Arg' Pro Ala Leu Leu Thr Ser Arg ifeu Arg Phe lie Pro Lys Pro jap 
65 "70 7 ° 

Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly Ala-Arg Thr 
■ 85 90 

Phe Arc Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 lQo 

-. „ n Ala P-ra -*-q Pro Gly Leu Leu 
Leu Phe Ser Val Leu Asn iyr Glu .-..q Aia J-..g .-g * . 

115 . I 20 ; 123 



Giy Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala Trp Arg Thr 

130 ' 135 

Phe val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe 
145 I 50 , 

, n.. N „ »ij T,:r Aso Tpr He Pro Gin Asp Arg 
Val Lvs Val Asp Val Tnr v,ly .-.la - 1- - Asp .... 

165 170 

~ ■ n w=i tip Ala S = r Tie He Lys Pro Gin Asn Thr Tyr Cys 
Leu Tnr Glu Val He Ala s-r --<= y i 90 

180 183 




26 



Val Arc Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly His 
" 195 200 205 



Lvs Ala Phe Lys Ser His Val Leu Arg Pro Val Pro Gly Asp Pro Ala 
210 215 220 

Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu Arg Arg His 
225 230 235 240 

Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala Ala Pro Ala 
245 250 255 

Phe Gly Gly 



(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS :- 

(A) LENGTH: 34 amino acids. 

(B) TYPE: amino acid 

(C) . STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE T YPE : - pept ide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = Leu or He" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f iec-si te_ 

(5) LOCATION: 10 ^ 

£D5 OTHER INFORMATION: /prccruct = "OTHER" 

/noL£= "Xaa = - Leu or- He" 

(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-site 
( 5 ) LOCATION: 11 

(D) OTHER INFORMATION: '/product* "OTH-R" 

/note- "Xaa = Leu or He" 

(ix) FEATURE: 

(A) " NAME/KEY : Modi f ied-s it e 
(3) LOCATION : 12 

(D) OTHER INFORMATION: ' /product* "OTHER" 

/note= "Xaa = Gin. or Arg" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 
(3) LOCATION: 28 

(D) OTHER INFORMATION: /product = "OTHER" 

/note= "Xaa = Phe or Tyr" 

(ix) FEATURE : 

(A) NAME/ KEY : Modi f ied-s i te . 
(3) LOCATION: 2 9 

(D) OTHER INFORMATION: /product "OTHER" 

/ n o z. e = " X £ <= = P r : e or Tyr 

201 



(ix) FEATURE: 

(A) NAME/ KEY : Modif led-site 

(3) LOCATION: 31 ,„„„„ 

(D) OTHER I N FORMAT I ON : /product- OTH^R 

1 ' /note= "Xaa = Lys or His 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

. Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp. 
xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa 



20 25 30 



Xaa Trp 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



ii) MOLECULE TYPE: peptide 



(i 



(ix) FEATURE: 

(A) NAME/ KEY : Modi fied-site 

(3) LOCATION: 2 Il - T ,^„ 

(D) OTHER INFORMATION: /procucu- OTrt^H 

■ ' /.note- "Xaa = Leu or He' 

(ix) FEATURE: . " 

(A) NAME/ KEY : Modi £ ied-s l te 
(3) LOCATION: 10 n 
(D) OTHER INFORMATION: /product = ."OTHlR 



/note= "Xaa = Leu or lie' 



(ix) FEATURE: 

(A) NAME/KEY: Modi f i ed- s i t e 

(3) LOCATION: II ..^..-o" 

(D) OTHER INFORMATION: /product- OTr^R - 

1 /note= "Xaa = Leu or lie 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 12 

(0) OTHER INFORMATION: /product- "Om^ 

K ' * ' /note- "Xaa = Gin or Arc 

(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-si te 

(B) LOCATION: 29 lt „.^ 0 „ 
fD) OTHER INFORMATION: /procucu- Oin-., 

/note- "Xaa = Phe or lyr 



28 



(ix) FEATURE: ' ^ 

(A) NAME/KEY: Modi f iec-s it e 
(3) LOCATION: 30 

(D) OTHER INFORMATION: /product= "OTHcR 

/note= "Xaa = Phe or Tyr 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 32 

(D) OTHER INFORMATION : /product= "OTHER 

/note= "Xaa = .Lys or His 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp 



Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa 
20 



25 30 



Xaa Xaa Trp 
35 



(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(i) SEQUENCE .CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 
*{3) TYPE: amino acid 

(C) -.ST RAN DE DN ES S : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(3) LOCATION: 1 . . 1 2 9 , 

(D) OTHER INFORMATION: /note- "TRT motirs rro* num.n 



(xi) SEQUENCE DESCRIPTION: SEQ ID- MO: 13: 



Ma Lvs Phe Leu His Trp l< 
1 5 " 

Arg Ser Phe Phe Tyr Val Th 



>u M-t Ser Val Tyr Val Val Glu Leu Leu 
10 15 

r Glu Thr Thr Phe Gin Lys Asn Arc Leu 



20 



Phe Phe Tyr Arg Lys S 



25 



30 



er Val Trp Ser Lys Leu Gin Ser lie Giy He 



40 



4 5 



Arc Gin His Leu Lys Arg Val Gin 



Leu Arg Glu Leu Ser Glu Ala Glu 



50 



55 



Vai~Ara Gin His Arg Glu Ala Arg Pro 



60 



Ala Leu Leu Thr Ser Arc Leu 



65 



70 



75 



80 



, ro Phe lie Pro Lys Pro Asp Giy Leu Arg 

r. z 90 



Pro He Val Asn Met Asp 

c - 
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Tvr Val Val Gly Ala Arc Thr Phe Arg Arg Glu Lys Arg Ala Giu Arg 



100 



105 HO 



Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg 
115 120 125 



Ala 



2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix)" FEATURE: 

(A) .NAME/KEY: Peptide 
(3) LOCATION: 1 . .233 

(D) OTHER INFORMATION: /note= "TRT motifs from 

Schizosaccharorayces pombe tezl' 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

■ He Giu He Glu Tro Leu Val Leu Gly Lys Arg Ser Asn Ala Lys 

1 5 10 15 

M-t Cvs Leu Ser Aso Phe Glu Lys Arg Lys Gin He ?h.e Aia Glu Phe 
' 20 25 30 

Tie t v - Tr D Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe 
' 35- 40 <5 

o he t y r- Tie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe 
5b" " -55 60 

arq L »s ^so He Tro Lvs Leu Leu Cys Arg Pro Phe He Thr Ser Met 
65 •" * "lb " 30 

Lys Met c-lu Ala Phe Glu Lys He Asn Glu Asn Asn Val Arg Met Asp 
85 90 95 

Thr G m Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys 
100 105 HO 

Lvs Asn Thr Phe Arg Leu I Is • Thr- Asn Leu Arc Lys Arg Phe. Leu lie 
115 120 125 

Lys w=t Gly Ser Asn* Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leo 
"l30 135 1^0 

Arc s-o Val Ala Ser He Leu Lys His Leu He Asn Glu Giu Ser Ser 
145 150 155 160 

Gi v Tie or 0 Phe fsn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys 
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Lvs tso Leu Leu Lvs His Arg Met ?he Gly Arg Lys Lys Tyr Phe Val 
Y " ' 180 185 190 

A-q Tie Asp He Lys Ser Cys Tyr Asp Arg He Lys Gin Asp Leu Met 
195 200 205 

Phe A-g He Val Lys Lys Lys Leu Lys Asp Pro Glu Phe Val lie Arg 
210 215 220 

Lys Tyr Ala Thr He His Ala Thr Ser 
225 230 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) E-'.OLECULE TYPE: peptide 



(ix.) FEATURE : • 
(A) NAME/ KEY: Peptide 
(3)" LOCATION: 1 . . 233 

(D) OTHER INFORMATION: /note- "TRT motifs from Saccna romyces 

cerevisiae EST2" 



!:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

L-u 'vs Aso Phe Arg Tr= Leu Phe- lie Ser Asp He Trp Phe Thr Lys 

r "* s . ■ _ io 

His -s- Ph- Glu Asn Leu Asn Gin Leu Ala He Cys Phe He Ser Trp 

20 ■■ 25 30 

Leu Arc Gin Leu He Pro Lys lie He -Gin -Thr Phe Phe Tyr Cys 

" ■ " W 40 45 - 



Th- G'u II- Se- Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
' 50 ~ 55 60 

fro Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr 
65" .70 15 ■ 



Thr 



Tyr 
80 



Leu Val Glu Asn Asn Val Cys. Arg Asn 
8 5 



His Asn Ser Tyr Thr Leu Ser 
90 9 = 



Asn Asn His Ser Lys Met Arg He He Pro Lys Lys Ser 

100 1° 5 110 

\ 

• G i u fro He He Ala He Pro Cys Arg Gly Ala Asp Glu G1-. 

115 - ■ 120 125 . 

Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Th: 
130 135 "0 



= r. Asn 



Glu 



Gin 





31 



Lys lie Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys 
145 150 

Ile Tyr Ser Pro Thr Gin He Ala Asp Arg lie Lys Glu Phe Lys Gin 



165 

Arc Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met 

180 18b 
Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser lie Pro Arg Met- Glu Cys 

195 200 
Met Arg lie Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val 



210 



215 



Arg Ser Gin Tyr Phe Phe Asn Thr Asn 
22 5 230 - 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(3) LOCATION: 1..2 33 _ „ r-^i^-oc 

(D) OTHER INFORMATION: /no.e- TRT mOL-~s 

aediculatus o12j>" 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



~hr Arg 



Glu Ile Ser Trp M< 



it Gin Val 



Glu Thr Ser Ala Lys His Phe 



10 



15 



ou a riu tsn T'. e Tyr Val Leu Trp Lys Leu Leu Arg 

Tvr Tyr Phe Asp ms Glu Msn --e i/- ^. 



20 



25 



30 



w-i \/ a i q D r Ten I i e Arg Cys Phe Phe Tyr 

,Tr 0 lie Phe Glu Asp Leu -Val. Val Ser Leu.i-_ r-..y / 

35- 40 - " 

Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 



55 



Val Thr Glu Gin Gin Lys 
50 

. . ^ c^r- Tic i i p. A^o Leu Lys Lvs 
Asn Tie Tro Aso Val Ile Met lvs i.et Ser lie — ? , ^ 

70 10 



65 



Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu Tro Lys Lys 

8 D 



Ser Leu Gly Phe Ala Pro Gly Lys 



Leu Arg Leu He Pro Lys Lys Thr 



100 



105 



110 





32 



Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys ne vai 



Val Asn Ser Asp 



115 



120 



Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu Asn Ser His 

130 135 
Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp Pro Phe Gly 



145 



150 



Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr Glu Glu Phe 
165 170 

val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe Phe Ala Thr 

180 185 



Met Asp He Glu Lys Cys Tyr Asp Ser . 

195 200 * 

Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser Ser Asp Phe Trp lie Met 



Val Asn Arg Glu Lys Leu Ser 



210 



215 



220 



Thr Ala Gin lie Leu Lys Arg Lys Asn 
225 . 230 

:2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE : amino acid 
-. (C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note* " cons ; 

from motif T" 



nsus telomerase RT sequence 



(ix) FEATURE: 

{A} NAME /KEY: Modi f ied-site 

(B) LOCATION: .1 •* pit u d " ' 

(D) OTHER INFORMATION: /procuct- OTric . . ■ 
{ - }} w /note* "Xaa - polar ammo acic, Gly, 

Ser, Thr, Tyr, Cys, Asn or Gin" 



<i) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Xaa Phe Phe Tyr 
1 




(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE-: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 7 
(D) OTHER INFORMATION 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 
■(B) LOCATION: 1 
(D) OTHER IN FORMAT ION : /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro,- Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
■ or Met" 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

v aa £. rg x ea He Pro Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS:. 

(D) -TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
* (B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /no:e= "consensus telomerase *T sequence 

from motif 2" 



: /note= "consensus telomerase RT sequence 
from motif 1" 
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(ix) FEATURE: 

(A) NAME /KEY : Modi £ led- s i te 

(B) LOCATION: 3 ( ' 

(D) OTHER INFORMATION: /product^ "OTHER" ^ 

/note- "Xaa = hydrophobic amino acia, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Phe Arg Xaa He 

1 ■ ' 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..6 

(D) OTHER INFORMATION: /note- "consensus telonerase Ri sequence 

from motif A" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i te 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /products "OTHER" • 

/ no te= "Xaa = charged amino acid, Asp, 
Glu, His, Lys or Arg" 



(ix) .FEATURE: 

(A) NAME /KEY : Modi f ied-site 
(2)- LOCATION: 6 

(D) OTHER INFORMATION: /product- "OTKr-R" 

/not e= ' "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp- 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:20: 

Pro Xaa Leu Tyr Phe Xaa 
1 5 - 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i). SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: ' 

(A) NAME/KEY: Peptide 



\l\ SSrS^IOH: /not- «l«r... »T .«■»"<=. 

from motif B' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Gly He Pro Gin Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 



'SI SSK'IXoUJw*. /not,- -consent W 



(D) ^ . „ 

i rom motif C. 



SEQUENCE' DESCRIPTION: SEQ ID NO : 2 2 : 
Leu Leu Arc Leu 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS.: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(£) N^ME/KEY: Peptide 

IS! S?5.S"?5ro^i.».= /not- »!=-«... ST ..,u.nc. 

from motif C 
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(ix) FEATURE: . 

(A) NAME /KEY : Modi f ied-site 
(3) LOCATION: 5 

(D) OTHER INFORMATION: /product^ "OTHER" _ ^ 

/note= "Xaa = hydrophobic amino acia, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Asp Asp Phe Leu Xaa lie Thr 
1 5 



2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .48 amino acids 

(3) TYPE: amino aci.d 

(C) STRANDEDNESS: 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix") FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . 4 8 

(D) OTHER INFORMATION: /note- "motif T peptide from 

Schizosaccharomyces pombe TRT tezlp 



xi) SEQUENCE DESCRIPTION :- SEQ ID NO: 24: 



i rp 



L*u Tv- Psn'ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 
c - - 10 15 



Ilo Thr Glu Ser Ser Aso Leu Arg Asn Arg Thr Val Tyr Phe.Arg Lys 

20 25 30 

*so Ti« Tro Lys Leu Leu Cys Arg Pro" Phe lie Thr Ser Met Lys Met 
" ' 35' .40 45 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids'- 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii.) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: I . . 54 
(D) OTHER INFORMATION 




/note= "motif i and 2 peptide from 
Schizosaccharomyces pc~.be TRT zazlp' 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Asn Asn Val Arg Met Asp Thr Gin Lys Thr Thr Leu Pro Pro Ala Val 

1 5 10 ■• 

lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr Asn Leu 

Arg Lys Arg Phe Leu lie Lys Met Gly Ser Asn Lys Lys Met Leu Val 

35 4 0 4 ^ 



Ser Thr Asn Gin Thr Leu 
50 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) ST HANDEDNESS-: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(3) LOCATION: 1 . . 34 . ... 
(D) OTHER INFORMATION : /note- "motif A peptice rrom ^ 

Schizosaccharomyces pombe iRi tezlp 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 26: 

> Asd lie Lvs Ser 
10 * * 15 



Phe Gly Arc Lys Lys .Tyr Phe Val Arc lie Asp He Lvs Ser Cys Tyr 



5 



-■ tu Lys Gin Asd Leu Met Phe Arg. lie Val Lys Lys Lys Leu 
4 20 25 ~ 30 



,ys As: 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:-.) FEATURE: 

(A) NAME/ KEY : Peptide 

(3) LOCATION : 1 . . 35 . 

(D) OTHER INFORMATION: /note- "motif B' peptide ^rom 

Schizosaccharomyces pombe TRT cezlp 



3S 



(xi) SEQUENCE: DESCRIPTION: SEQ ID NO: 27: 

Ser G 1 n Tyr Leu Gin Lvs Val Gly lie Pro. Gin Gly Ser lie Leu Ser 
1 ~ 5 10 15 

Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu 
20 .25 30 

Ser Phe Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

"(ix) FEATURE: . _ " ' 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 4 3 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Schizosaccharomyces pombe TRT tezlp' 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Leu L— &rg val Val Aso Asd Phe Leu Phe lie Thr Val Asn Lys Lys 
1 - 5 . ' ■ 10 15 

Aso Lys Lys Phe Leu Asn Leu Ser Leu Arc Gly Phe Glu Lys His 

20 ' ■ 25 30 . 

Asn Phe Ser Thr Ser Leu Glu Lys Thr Vai lie 
35 40 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 
(3)- TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

■ (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1 . . 11 " 

(D) OTHER INFORMATION: /note= "motif E peptide from 

Schizosaccharomyces pcmbe TRT tezlp' 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

!o! SES'S^SoN. •/»«.- -~ti« T peptide t.m *-» TRT- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
Trp Leu Met Ser Val Tyr Val Vai Giu Leu Leu Arg Ser Phe Phe Tyr 

Val Thr Giu Thr Thr Phe Gin Lys Asa Arg Leu Phe Phe Tyr Arg Lys 

20 2o 
Ser vai Tro Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu Lys 
" 35' 4° 



(2) INFORMATION FOR SEQ ID NO : 3.1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino 'acids 
(3) TYPE: amino acid 

(C) . STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

!?, /note- -~t« 1 » P.P"« «<~ 

huraan TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: .-• 

Giu Val Arc Gin His Arg Giu Ala Arg Pro Ala Leu Leu Thr Ser Arg 
1 5 10 

Lvs Pro As? Gly Leu Arg Pro He Val Asn Met 



Leu Arc Phe He Pro Ly 



30 




40 



Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu 
35 40 45 

Arg Leu Thr Ser Arg Val 
50 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 - 

(D) OTHER INFORMATION: /note= "motif A peptide from human TRT " 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 

Pro P*-o Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr 
1 5 10 15 

Aso Thr He Pro Gin Asp Arg Leu Thr Glu Val lie Ala Ser He He 
20 25 30 

Lvs Pro 



{2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

"(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
CD) TOPOLOGY: linear 

(ii) KOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note= "motif 3' peptide from human TRT" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin Gly Ser He Leu Ser 
' 1 5 10 15 

Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe 
20 25 30 
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Ala Gly lie 
35 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .43 . . 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr 
1 5 10 15 

His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr 
20 25 30 

Gly Cys Val Val Asn Leu Arg Lys Thr Val Val 

35 .40 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : * . . " 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= . "motif E peptide- from hunan TRT' 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 35: 

His Gly Leu Phe Pro TrD Cvs GIv Leu Leu Leu 
1 5 10 



0Q. 
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2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 48 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..4 8 

(D) OTHER INFORMATION: /note= "motif T peptide from Euplotes 

aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp He Phe Giu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 

1 5 . 10 15 

Val Thr Giu Gin -Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 

20 25 _ _ 30 

Asn He Trp Asp Val He Met Lys Met Ser lie Ala Asp Leu Lys Lys 

35* 40 45 



2} INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(At) LENGTH: 54 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 54 

(D) OTHER INFORMATION: /note= "motif 1 end 2 peptide from 

Euolotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Lys Giu Val Giu Giu Trp Lvs Lys Ser Leu Gly Phe Ala Pro Gly Lys 

. 1 .5 " 10 15 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arc Pro He Met Thr Phe 
20 25 30 

Asn Lys Lys lie Val Asn Ser Asp Arg Lys Thr Thr Lys Leu -Thr Thr 
35 40 45 



Asn Thr Lys Leu Leu Asn 
50 



43 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .34 

(D) OTHER INFORMATION: /note- "motif A peptide from Euplotes 

aediculatus pl23" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Giy Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr 
1 5 10 _ 15 

Asd Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
20 25 _ 30 . 

Leu Leu 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(3) LOCATION: 1 . . 35 • 
(D) OTHER INFORMATION: /note- "motif B' peptide from Euplotes 

aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser 
1 5 10- . 15 

S or- T le Leu S-r Ser Phe Tvr Tyr Ala Thr Leu Glu Glu Ser Ser Leu 
20 25 30 

Giy Phe Leu 
35 






(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
• (B) LOCATION: 1. .43 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Eupiotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: . 

Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie Thr Thr Gin Glu Asn 
1 5 10 15 

Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu Asn 
20 " 25 30 

Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr 

35 40 



(2) INFORMATION ' FOR SEQ ID NO: 41: 

SEQUENCE CHARACTERISTICS: 
tA) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMZSS : - 

(D) TOPOLOGY : linear 

MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 11 ' 

(D) OTHER INFORMATION: /note= "motif E peptide from Eupiotes 

aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Gin Aso Tyr Cys Asp Trp He Gly He Ser He 
1*5 10 



(2) INFORMATION FOR SEQ ID NO : 4 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(i) 



(ii) 




p 



45 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..47 

(D) OTHER INFORMATION: /note- "motif T peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Trp Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr 
1 5 10-15 

Cys Thr Glu lie Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
.20 25 30 

Thr Trp Asn Lys Leu lie Thr Pro Phe He. Val Glu Tyr Phe Lys 
35 4.0 45 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid . 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION : 1 . . 2 6 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Saccharomyces cerevisiae EST2p M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Cys Arc Asn His Asn Ser Tyr Thr Leu Ser Asn Phe Asn His Ser Lys 
1 * 5 . 10 10 

Met Arg He He Pro Lys Lys Ser Asn Asn 
20 * 25 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 29 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



m m 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..2 9 

{ D) OTHER INFORMATION: /note= "motif 2 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Phe Arg lie lie Ala lie Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
1 5 10 15 

Thr lie Tyr Lys Glu Asn His Lys Asn Ala lie Gin Pro 
20 25 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
"(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note= "motif" A peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 45: 

Val Leu Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr 
I 5 10 15 

Asp Ser lie Pro Arg Met Glu Cys Met Arg lie. Leu Lys Asp Ala Leu 
20 25 30 

Lys Asn ■ 



(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 35 
(D) OTHER INFORMATION: 



/note= "motif 
Saccharomyces 



B 1 peptide 
cerevisiae 




from 
EST2p" 




47 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:46: 
Lys Cys Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser 



Ala Pro lie Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser 
20 25 30 

Glu Phe Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 47: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(D) OTHER^ INFORMATION : ./note- "motif- C ^nd D peptide from 

Saccharomyces cerevisiae EST2p 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Asp. Gin Gin 
1 5 10 

Gin Val lie Asn lie Lys Lys Leu Ala Met Gly Gly Phe Gin Lys. Tyr 
20 25 30 

^sn Ala Lys Ala Asn Arg Asp Lys lie Leu Ala 
35 " 40 



(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(3) LOCATION: 1..11 . _ " , . . 

D) OTHER INFORMATION : /note- "motu ^ peptide from _ 

Saccharomyces cerevisiae EST2p 



— 4 8 



<xi). SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Lys Glu Leu Glu Val Trp Lys His Ser Ser Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(ix) FEATURE: . " 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus non-telomerase RT - 

sequence from motif B 1 " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-s ite 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, . Leu., lie, Val, Pro, Phe, Trp 
or Met" 



(:<i) SEQUENCE DESCRIPTION : SEQ ID NO: 4 9: 

Xaa Pro Gin Gly 
1 * 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
{ D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
■(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /note= "consensus non-telomerase RT 

sequence from motif C" 




